HEAVINESS IN SYMBOLIC DYNAMICS: 
SUBSTITUTION AND STURMIAN SYSTEMS 



David Ralston 

Department of Mathematics, The Ohio State University 
231 W. 18th Avenue, Cohimbus, OH 43210, USA 



Abstract. Heaviness refers to a sequence of partial sums maintaining a cer- 
tain lower bound and was recently introduced and studied in [11]. After a 
review of basic properties to familiarize the reader with the ideas of heavi- 
ness, general principles of heaviness in symbolic dynamics are introduced. The 
classical Morse sequence is used to study a specific example of heaviness in a 
system with nontrivial rational eigenvalues. To contrast, Sturmian sequences 
are examined, including a new condition for a sequence to be Sturmian. 

1. Introduction 

Dynamical systems devotes much attention to the asymptotic behavior of points 
or other elements of a system. While asymptotic properties are extremely impor- 
tant, they are in a sense not observable; an observer monitoring a closed system 
can only ever observe a finite window of time. Suppose that an observer is ca- 
pable of monitoring the output of a function / over a finite portion of an orbit 
X, T{x), T'^{x), . . . , T^{x), and keeps a record of the associated partial sums. Finite 
observations do not lend themselves to discussion of limits, but any observer might 
be concerned with extremal behavior of the partial sums (a motivation similar to, 
but distinct from, that in the study of large deviations). With this restriction and 
motivation in mind, we define the heavy set (subject to various restrictions to be 
outlined later) to be those points in a system for which these partial sums maintain 
a natural lower bound over a natural collection of finite ranges. 

In applying these notions to symbolic dynamics, we will note a distinction be- 
tween systems with rational eigenvalues (§3) and a class of systems with no rational 
eigenvalues (§4). Specifically, the Morse sequence defines a nontrivial system with 
an abundance of heavy points, while Sturmian sequences have a scarcity of heavy 
points. Furthermore, Sturmian sequences are most frequently defined with a re- 
striction on the allowable weights of subwords, and heaviness will be concerned 
with establishing bounds on weights of words, so a connection between the two 
ideas is developed, most significantly in Theorem 7. 

I.l. Background and terminology. Before defining heaviness formally in §2, it 
is necessary to establish our framework and notation. Let {D,,fi} be a probability 
measure space (/i(il) = 1)- If T : ^ is /i-measurable, and /i(r~^r) = ^(r) 
for all /x-measurable F C 51, then we say that T preserves fi, and {Q,fi,T} is a 
probability measure preserving system. In this situation, let / G L^{^, jj). If the only 
functions / such that foT = / almost-everywhere are themselves almost-everywhere 
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constant, then T is called ergodic, and if /i is the only preserved probability measure, 
T is called uniquely ergodic. For u & fl, define 5'„(a;) recursively: Soioj) = 0, 
S'„+i(cj) = 5„(cj) + / o T^{uj) (if T is invertible, we may use this relation to define 
for all ?i e Z): 

n-l -1 

(1) 5„H = ^/orH (n>0), ^„M = -^/orH (n<0). 

'i=0 j 

In the vein of classical concerns of asymptotic behavior over infinite time frames, 
define: 

(2) /4u;) = liminfi5„M, 

n — »oo 77, 

noting that the Birkhoff Ergodic Theorem guarantees that lim„_»oo 'n~^Sni(^) exists 
almost everywhere. 

We may use the fact that T preserves /i to derive Sn{x)dfj, = n J^-^ fdfi. So, in 
line with our model of an observer of finite time periods, we define: 

Definition 1. The heavy set for f (relative to T ) between times m and n (m^nEZ, 
m < n) is given by: 

(3) Ti.^{m,n) = l^uj : Vi, m < i < n, Si{ui) — ^ J f'^f- 

In the common event that m = 0, the set is called heavy through time n. We use 
the shorthand: 

oo oo 
j=0 i=0 

to define the heavy sets over N and Z. Any use of negative times requires to 
exist. 

If T and / are clear from the context, we will simply write 7Y(m, n), 7Y(N), or 
H(Z). These sets represent points whose partial sums meet or exceed the average 
value of the partial sums over the range of prescribed times. Given the emphasis 
on finite time periods in defining heavy sets, it is worth pointing out that stating 
X e H{N) or X £ W(Z) should not be read as a statement about the behavior of 
Sn{x) on an infinite time frame, but rather about all finite times. We now present 
a pair of theorems regarding the existence of such points. 

Theorem 1. If {fl, ^,T} is a measure-preserving system, and f G L^{n,fi), then 
fi (Ti.ip{0,n)j > for any n G N. Furthermore, if T is invertible, then T is ergodic 

if and only if for any m <n, m,n gIj and f G L^{Vl,^i), ji (Ti.ip{m,n)j > 0. 

Proof. For a proof of this theorem, see [11]. □ 

Theorem 2. //{fi, /.i,T} is a continuous measure-preserving system on a compact 
probability space, then H^(N) ^ for any upper semi- continuous f G L^(f2,/Lt). 
Furthermore, if f is continuous andT is invertible and transitive, then 7Y;J,(Z) ^ 

Proof. For a proof of this theorem, see [11]. The first statement follows as a corollary 
from Theorem 1, but an alternate proof may be found as a lemma of Y. Peres 
[10]. □ 
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Remark. If T is closed, then (N) ^ (as xr is upper semi-continuous), but 
Ti^ {I) ^ is true in general only if F is clopcn (in this case, XA is continuous). 

Remark. It is a common mistake to assume that ^ |^7i;^(N)^ = 0. This claim 

is obviously false for functions / which are constant almost everywhere, for any 
function / on an atomic system (any nonempty set is of positive measure) , and in 
general for nonergodic T (see Corollary 1). 

In §3.1, we give an example of a uniquely ergodic system without atoms, and 
a function / which is not constant, such that /Li(7i(N)) > 0. However, as proved 
in [7], if T is ergodic, then for almost every a; G O, there exists some N = N{aj), 
< N < oo, for which S'Ar(a;) < N fdji. There is no contradiction between this 
fact and Theorem 1; if H(N) is not a null set, then for almost all lo e JiiH) there is 
some finite > such that S'Ar((jj) ~ N fdji. This situation will be investigated 
in §3.2. 

Proceeding to definitions specific to symbolic dynamics, the reader familiar with 
standard terminology may skip to §2, except to note the nonstandard Definitions 2 
and 3. We let an alphabet ^ be a subset of R, and elements of A are called letters. 
An element A G A"' is called a word of length n over A. We sometimes write 
^|. An element X £ A^ is said to be a sequence, and X G is a hi-sequence. 
In the frequent event that A = {0,1}, the word is called binary. The somewhat 
nonstandard definition of an alphabet to be an arbitrary subset of M is motivated 
by the canonical relation of sequences in A!^ to sequences {f{T^uj)}i=o,i.2,... m a 
probability measure preserving system {Q., T} along with / G (f2, ^) and w G 51. 
In this case, A is the range of /. By assuming to be compact, continuity of / 
implies compactness of A, and in the common scenario that / is the characteristic 
function of a set, A = {0, 1}. By relating points in A^ and sequences {/(TV)} (i = 
0, 1, . . .), then, the shift operator on the space Ai* is analogous to the transformation 
T : ^ fj, and heaviness statements about measure preserving systems in general 
may be interpreted as statements about heaviness in shift systems. 

For a binary word, we define the conjugate oi A = a^ai . . ., denoted A = oq . . ., 
by setting 0=1 and 1 = 0. For any word A = oq . . . a„_i of length n < oo over 
any alphabet, the transpose of A is denoted and defined by A^ = a,i-ia„_2 • . • ao 
(so that Af = An-i-i for i = 0, . . . , n — 1). Given two words A = oq . . . flm-i, 
B ^ bo . . . bn-i of finite lengths m and n, the concatenation of A and B is the word 
of length m + n given by AB = ao . . . Om-ibo ■ ■ ■ bn-i. Given a word A G A" , the 
weight and average weight, respectively, are: 

n — l ^ 

w{A) = Oi, wiA) = -wiA). 
^-^ n 

1=0 

A word A of length n is said to be a factor of another word (or sequence) B of 
length m > 71 if there is some j G N so that Oi = bi+j for i = 0, 1, . . . , n — 1. The 
complexity function for a sequence or bi-sequence X (over a finite alphabet A) is 
given by 

p{n) = e A" : Ais a factor of X}. 

A binary sequence X is said to be of minimal complexity if p{n) + 1 (any sequence 
X for which p{n) < n for some n is eventually periodic - see [4]). If A is a word of 
length n < oo such that there are two distinct letters a, (3 € A such that Aa and 
A(3 arc both factors of X , then A is said to be a right special factor. If there are 
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two distinct letters a and /3 such that a A and (iA are both factors of X, then A is 
said to be a left special factor. 

Given a sequence X 6 Jl^ or J^, we define the sequence (t{X) by cr(X)„ = Xn+i- 
If ^ is compact, then so are and (in the product topology), and therefore 
f2 = {cr^Xj^Q is compact. It is seen that cr is now a continuous map of a compact 
space. The system generated by the sequence X is the topological dynamical system 

Definition 2. Let A = oq . . . a„_i and < i < j < n. Then A^.j ~ ai . . . aj_i is 
a word of length j — i, beginning at index i (note that A = Ao^n, and A^.i is the 
empty word of length zero) . 

Definition 3. Let A = qq . . . a„_i be a word of length n, over alphabet A. Then 
the reversal of A is the word over the alphabet —A^ {—a : a e ^}, defined and 
notated by 

p{A)i = -an-i-i, i = 0, 1, . . . , n " 1. 
That is, p{A) is the transpose of A, with a negative sign on all entries. For n < 0, 
p(A„,o) = (-a-i)(-a-2) ■ • • (-a«)- If we define -A = (-ao)(-ai) . . . (-a„_i), 
then p{A) = -(A"^) = (-^)^- For any word A, w{A) = -w{p{A)). 

Remark. Compare with (I), our partial sums over negative times. By defining 
/ : — > M by f{X) = xq, and cr is the shift operator, for n > 0, Sn{X) = 'w{Xo^n), 
and for n < 0, 5„(X) = w{p{Xn,o)) (for n = 0, both equal 0, the weight of the 
empty word). This relation is the motivation for defining p. 

2. Heaviness in symbolic dynamics 

There are two fruitful ways to define heaviness in symbolic dynamics. The first, 
a-heaviness (§2.1), is a direct analogue of the definition of heaviness in Theorem 1. 
The second way to view heaviness, local heaviness (§2.2), is more combinatorial in 
nature. In presenting both views, we will spend some time to familiarize the reader 
with the definitions by presenting several theorems regarding the existence of such 
phenomena in very general settings, before we proceed to considering any specific 
systems. 

2.1. a-Heaviness. In situations where we are interested in a global target for heav- 
iness, some fixed a which will act as a lower bound on our partial averages, we 
proceed as follows: 

Definition 4. Let A = oq . . . On-i- A is said to be a-heavy (a-light) if for all 
l<i<n: 

(4) w{Ao,i) > a iw{Ao,i) < a). 

If X is a sequence, then X is a-heavy (a-light) if for all i G N: 

w{Xo^i) > a {w{Xo^i) < a). 

Remark. Trivially, if A is a-heavy (or light), then so is the initial factor Aqj for all 
< j < m. 

The following lemma is not difficult, but will be of great use in §4.1: 
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Lemma 1 (The Reversing Principle). Assume that A is of length ti + 1, such that 
A ^ gq . . . a„-i is a-heavy, but 

w{ao . . . a„„ia„) < a. 

Then the word (ao . . . a„)^ = a„a„_i . . . oq is a-light. Equivalently, the word 
p{Aa^n+i) is {~a)-heavy. 

Proof. Assume that there is some i G {0, 1, . . . ,n} such that ?U(a„a„_i . . . a„_i) > 
a. We may assume i ^ n. Then as W(ao . . . a„_i) > a (the word oq . . . a„-i was 
a-heavy), we clearly have W(ao . . . a„) > a, a contradiction. □ 

^From this point, we refrain fr'om statements in terms of both lightness and 
heaviness; we refer only to heaviness properties, but analogous statements regarding 
lightness are all possible. Recall that a set ^ C M is well-ordered by '>' if it 
contains no infinite increasing sequence. We take '>' to be our standard ordering; 
for lightness, '<' would be the relevant ordering. 

Lemma 2. Let A C R be well-ordered. Then for every n G N, the set Bn = 
{w{A) : A G A"'} is also well-ordered. 

Proof. It suffices to prove that if both A and B are well-ordered, then so is the set 
{A + B) = {'J : J = a-\- (3, a e A, 13 e B}: our B„ are subsets of (^-|-^-|-. . . + A)/n, 
which is well-ordered if and only if (^+^+. . .+^) is well-ordered. To the contrary, 
let 7i,72, . . . be an increasing sequence in ^ -I- i3, where 7^ = -f /3i, G A, 
j3i G i3. As ^ is well-ordered, we may pass to a subsequence ^n{i) such that a„(i) is 
monotonically decreasing (not necessarily strictly): define 

n{i + 1) = min{n > n{i) : a„ ~ max{aj : j > ri{i)}} 

and see that an{i+i) < ctn{i)-, ^nd each n{i) is defined by well-orderedness of A. As 
jn{i) E^re increasing, and the Q!„(i) arc nonincrcasing, the must be an infinite 
increasing sequence in B, a contradiction. □ 

We now investigate just how frequently one may expect to find a-heavy factors 
of arbitrary sequences, depending on the target value a. 

Lemma 3. Let a > —00 and X be a sequence such that: 

liminf u'(Aro,n) = ct. 

Then for any S < a, there is an N E N such that the sequence Xn^oo = x^xn^i . . . 
is S-heavy. 

Proof. Fix S < a, and assume to the contrary that for every N, there is some f{N) 
such that w{Xpf f(^]^-j) < 6. Set ko = and recursively define ki = f(ki-i). Then 
represent X = Xkgj(ko)^kij{ki) ■ ■ ■■ It is seen that for all i, ?U(Xi j(j..)) < S < a, 
contrary to our assumption that liminf„^oo i^(Xo.„) — a. □ 

Corollary 1. Let {fl,fj,,T} be a probability measure preserving system which is not 
ergodic, and let f G L^{fl,^) be such that /*(a;) is not almost everywhere equal to 
a constant. Then /i {Ti.(N)) > 0. 

Proof. As Jj^j f^,d^ ~ fd/J,, let T = {lo : /*(aj) > fd^i}. By assumption, /i(r) > 
0, and Vw G F, 3N such that T^{uj) G H(N) (by Lemma 3). As F is covered by the 
preimages of 7Y(N), fi{T) > 0, and T preserves fi, we must have that fi{'H(N)) > 
0. □ 
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It is not difficult to construct a sequence X of finite upper density: 

limsup wJ(Xo,„) = a < oo, 

n — >oo 

such that X does not have arbitrarily long a-heavy factors (for example, 
Xn/{n + 1) will construct a sequence of upper density one with no 1-heavy fac- 
tors whatsoever), but the following lemma will extend the idea of Lemma 3 as far 
as possible: 

Definition 5. Define the upper Banach density of X, for X G J^'^ or by 

d*B{X) = limsup ^{XaiM)- 

ibi—ai) — >oo 

Theorem 3. The alphabet A has the property that every X G contains arbi- 
trarily long d*g{X)-heavy words if and only if A is well-ordered. 

Proof. First, assume A is well-ordered. Then for any X G A^, d*g{X) < oo. If 
d'^{X) = — oo, there is nothing to prove, so assume d'^{X) G K, and fix some 
5 < d*g{X). Assume that there is some iV < cx) so that for every i, there is some 
f{i) < N for which j+jjj)) < S. Similarly to Lemma 3, represent X as a string 
of concatenated words of average weight strictly less than S, but note that there is 
now have a universal bound on the length of the words. It follows that d*^{X) < 6: 
for very large bi — Oi, words of length bi — Oi may be considered as a concatenation 
of factors of length no larger than N, of average weight less than S, plus small extra 
pieces at the end of bounded length and weight. So, X must contain arbitrarily 
long (S-heavy factors for arbitrary S < d*g{X). 

Now assume that there is an X and a bound on the length of any d*g{X)- 
heavy factors of X. For any e > 0, let s = s(e) be a factor of length N which 
is {d*g{X) — e)-hcavy (but by assumption, not d*g{X)-h.ea.vy). Define a decreasing 
sequence by fixing an arbitrary eo > 0, and defining: 

d*giX)~w{S{e,)) ^ 
Q+i - ^ > U 

Continuing this process, create a sequence of words {S{€i)}°^Q of length N whose 
average weights are strictly increasing. By Lemma 3, there is a contradiction. 
Therefore, X must contain arbitrarily long d|j(A)-heavy factors. 

The proof of the converse is much shorter: let {a^j^Q be a sequence in A which 
is strictly increasing. Then X = aoai . . . does not contain any d|j(X)-heavy factors 
of any length. □ 

Corollary 2. Let X G A^'^ , where A is a well-ordered and compact subset of M.. 
Then under the transformation a, the space 0+(A) contains some X' which is 
dg{X)-heavy. 

Proof. Consider any limiting sequence X' of the words x{n), dg{X)-hea,vy length n 
factors of X (these x{n) exist by Theorem 3). By construction, d*g{X') = d'^{X), 
and 

infW(X^,„) = limsup w(A|^,„). □ 
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2.2. Local Heaviness. In the following section, we introduce a version of heaviness 
which docs not depend on an arbitrary constant a: 

Definition 6. A word A = aofli . . . a„_i is said to be heavy, or locally heavy, {light, 
or locally light), if for all 1 < i < n: 

w(Ao,,) > w{A) (w{Ao,^) < w{A)). 

Equivalently, for all 1 < i < 7i — 1: 

w{Ao^i) > w{Ai^ri) {w{Ao,i) < w{Ai^ri))- 

A sequence X is heavy (light) if for every i > 0: 

w{Xo^i) > limsupw(Xo,ri) {w{Xo^i) < liminf ?«(Xo,„) ) . 



The word A is locally heavy if and only if A is w{A)-hca.vy. However, the 'target 
value' in this case varies with the word in question, whereas in definition 4 there 
was a preordained a. In the case A = {0, 1}, light words are called Lyndon words, 
an object of study in combinatorics and computer science (see [9]). Again, however, 
we will suppress statements regarding light words. 

Remark. In contrast to a-heaviness, it is generally the case that for a given heavy 
A of length n, there may be some 1 < i < n for which ^o,i sie not be locally heavy. 
Consider the word 1010, which is heavy, and the initial factor 101, which is not. 

Lemma 4. Let A and B be heavy words. Then the concatenation AB is (locally) 
heavy if and only ifW{A) > W{B). 

Proof. The necessity is obvious: let \A\ and \B\ = m. liw{A) < w{B), then: 

w{{AB)o,n) = w{A) <w{B)^w{{AB)^^^+n). 
Now, assuming w{A) > w{B), we see that for i < n: 

w{{AB)o,^) > W{A) > w{AB) 

and for i > n: 

W {{AB)o^i) > w{Bo.i^n) > w{Bi^n,Tn) = W {{AB)i^n+ni) ■ □ 

Theorem 4. Fix an alphabet A. Then every X G contains arbitrarily long 
heavy factors if and only if A is well-ordered. 

Proof. First, assume that A is well-ordered. Let X G A^ and < oo be such that 
X contains no heavy factors of length longer than A^. Then represent AT as a chain 
of heavy words in the following manner: 

Let Ai = a;o...x„j_i be the longest possible heavy factor beginning at xq. 
By assumption, ui < N. Let A2 ~ Xm ■ ■ ■Xni+n2-i be the longest heavy factor 
beginning at Xm and again note that 712 ^ N . Continue in this manner to write 
X = A1A2A3 . . . where each \Ai\ < N. In light of Lemma 4, the average weights of 
these blocks must be strictly increasing. Furthermore, because the length of each 
An is bounded, there must be some specific length which occurs infinitely often, so 
there is an infinite collection of words of the same length, with strictly increasing 
average weight. By Lemma 2, this is impossible. 

Now, suppose that A has an infinite subsequence {ai}^o which is strictly in- 
creasing. Then the sequence X = aoai ... is seen to have no heavy words of length 
longer than one. □ 
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Remark. We do not claim that every X has heavy factors of every length! Consider 
the sequence 101010 . . .; the alphabet {0, 1} is certainly well-ordered, but X does 
not have any heavy factors of odd length larger than one. 

3. The Morse-Thue sequence and substitution systems 

In this section, we will define the classical Morse sequence, using it as an example 
to discuss certain aspects of heaviness. After discussing this sequence, we will make 
brief remarks extending these properties to a general class of substitution systems. 

3.1. Heaviness in the Classical Morse(-Thue-Prouhet) Sequence. The clas- 
sical Morse sequence may be built in the following manner. Let -A/(0) = 0, and 
M{i + 1) = M{i)M{i). The sequence M such that A/o,2" = M{n) is the one-sided 
Morse sequence: 

M(0) = 0, M(l) (0)(0) = 01, M(2) = (G1)(0T) = 0110, Af(3) = (0110)(0TT0), . . . 

and the two-sided Morse sequence is given by the word M ~ M'^M, centered about 
M(0) = M(0) (the decimal point appears to the left of Mq)- 

A/ = .0110100110010110... 

M = ... 0110010110.0110100110 .... 

For a survey of this history of this interesting sequence, including numerous 
applications and information on the many independent formulations, see [1]. It is 
easily seen that M2iM2i+i G {01, 10} (including i < in the case of M). 

Lemma 5. Let A be a word of length 2k < n < 2k + 2 which is a factor of either 
M or M. Then k < w{A) <k + 2. 

Proof. For a word of the form A' = M2i^2{i+k)i w{A') = k. Then A can only be of 
the form A', M2i-iA' , A' M2(^i+k)i or M2i-iA' M2(i^k)- By considering all choices 
(0 and 1) for values of A'hi-i and M2[i+k), establish the inequality. □ 

Corollary 3. Let X be any sequence which is a factor of the Morse sequence such 
that Xo^2 = 11- Then X is ■^-heavy. Similarly, if X is a sequence which is a factor 
of the Morse sequence, and X begins with 00, then X is ■^-light. 

Proof Pick an initial word Xq^ of the form IIX' where 2k < \X'\ <2k + 2. Then 
w{Xo,i) = 2 + w{X') > fc + 2, sow{Xo.j) > \. The proof is similar for Xq, 2 = 00. □ 

We now appeal to a well-known result: the system generated by M is uniquely 
ergodic (see [6, Ch. 5]). The unique invariant measure /x assigns [i{Y) = i, where 
r = {tj e O+Jm) : ujo = 1}, and ^(A) > 0, where A = : woOJi = 11} (in fact. 

Corollary 4. In the system {0+(Af ), /i, ct}, with the function xr, where 
T = {lu : ujQ = I}, we have 11 (HJ'"(N)) > /i (7i^'"(Z)) > 0, where M is the two-sided 
Morse sequence. In the one-sided sequence space generated by M , /i (HJ'"(N)) > 0. 

Proof. We have seen that A C ?i^'"(N), where A = : ujquji = 11}, and we have 
already seen that ^(A) > 0. Therefore, 7iJ'"(N) is of positive measure. Similarly, in 
the two-sided sequence, ?i^'"(— N) = fli^i '^a^{'~h 0) is of positive measure; infinite 
words which end in 00. Therefore, in the two-sided sequence, any bisequencc X with 
X-2X-iXoXi = 0011 will be in H^^{Z), and the set A' — {X : a;_2a;-ia;oa;i = 0011} 
set is also seen to be of positive measure (to be precise, /i(A') = j^). □ 
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3.2. General (Non-Mixing) Substitution Systems. We begin this subsection 
with the following theorem: 

Theorem 5 (Halasz [7]). Let {Q,T,^} be a probability-measure-preserving system. 
The value E{a) = e}™'^ belongs to the spectrum of T if and only if there exists a 
set r of measure a such that for ji-almost every lo £ Q, for all n G N; 

(5) |5„H-na|<l, 

where f{uj) = XAii^)- 

Corollary 5. Let {f2,^,T} be a probability-measure-preserving system, with a G Q 
such that E{a) is in the spectrum ofT. Then, letting /(w) = xri^), where T is a 
set which satisfies (5), fi{Ti.(N)) > 0. 

Proof. The quantities 5,1 (w) — na are bounded almost everywhere (from Theorem 
5) and discrete (^^(cj) £ Z, and a £ Q). Therefore, the minimum value of the 
sequence {Sni^^) — ?^a}^Q is achieved at some minimal time N{lu) for almost all 
w G n. So, w G T-"^'^) (H(N)). As 

^(n\[jT"^niN))^=o, 

we must have ^ (H(N)) > 0. □ 

We will provide a brief overview of a class of systems in symbolic dynamics with 
rational eigenvalues, as well as illustrating why satisfying (5) with an irrational 
eigenvalue does not guarantee positive measure heavy sets. 

The Morse sequence may also be viewed as a fixed point of the substitution 
defined by 01 and 1 10: 

^ (01) = 01 ^ (01)(10) = 0110 (01)(10)(10)(01) = 01101001 ^ ... 

In general, define a substitution system E on f2fc = {0, 1, . . . , fc — 1} by assigning 
S(a) G for all a G fife, and n{a) < oo for all a (that is, E assigns a word to 

each letter). The substitution matrix for S is: Aij ~ #{j in S(i)}. In the case of 
the Morse sequence, the matrix is given by Aij = 1. 

Two different substitution systems might have the same matrix. However, if the 
matrix A is primitive {3n : A^j > yi,j), then the shift map defines a uniquely 
crgodic system {ri,^} on some limiting sequence X G {I]"(a;o)}5^Li such that 
= X for some iV G N (see [6, Ch. 5]). 

In the event that cr is a substitution of constant length {n{a) is constant over Qk), 
and X is a periodic point under the substitution S, then the system {0+(X), ^, a} 
has nontrivial rational eigenvalues (see [6, Ch. 7]), and therefore there arc nontrivial 
/i-integrablc functions / : 0+(X) M with positive-measure heavy sets (in light of 
Corollary 5). 

Example 1. Fix A = {0,1,2} and i;(0) = 120, I](l) = 201, i;(2) 210 (note 
that each S(a) is a 1-heavy word of length 3 and average weight 1). Let X be 
the invariant limiting sequence lim„^oo S"(2) = 210201120210120..., and create 
a uniquely crgodic shift system {f2,/i, cr} (the substitution is primitive). If we are 
interested in making the system invertible, we may create the natural invertible ex- 
tension, as outlined in [3, pp 239-241]. Define f{u!) = Xq. The following progression 
is extremely similar to that carried out in §3.1, and details are omitted. 



10 



DAVID RALSTON 



First, to establish a 'target,' compute: fd^ = 1. Now, because ^3fc,3(fc+i) are 
always of average weight 1 , consideration of blocks which may precede or follow these 
'evenly weighted' blocks (for example, a;3fc+ia;3fc+2 ^ 21) yields that \Sn{X) — n\ < 1 
for any sequence X G ^l. So, any factor which begins with 2, 12, or 112 will be 
heavy, and if ^ is a factor ending with 0, 01, or Oil then p{A) is — 1-heavy: 

n(N) D {X : xo^2, xoxi = 12, xoXiX2 = 112}, 

n{Z) D {X ; a;_2X_ia;oXi G {1012,1021,0112,0121}}. 

Finally, quick density computations verify that yu (7i(N)) > /i (7i(Z)) > 0. 

To see the importance of rational eigenvalues to produce positive-measure heavy 
sets, let a ^ Q and consider the circle rotation {S^ , fi, Ra}, where is Lebesguc 
measure and Raii^j) — lu + a mod 1. Fix F = [0,0?) and /(w) = xrC"^)- Then 
|S'„(a;) — na\ is bounded [8]. However, as a ^ Q, these S'„(cj) — na do not necessarily 
ever achieve their infimum. Indeed, we will see in Corollary 6 that (N) is exactly 
one point, and therefore a null set. 

4. Sturmian sequences 

Definition 7. A binary word, sequence, or bi-sequencc X is called Sturmian if 
factors of the same length differ in weight by at most one. 

Remark. In a Sturmian sequence, the density of the sequence exists [6, Ch. 6]: 

d{X) = lim wiXo.n). 

n — >oc 

We restrict our attention to those Sturmian X of density a ^ Q; Sturmian sequences 
of rational density are eventually periodic. 

4.1. a-Heaviness in Sturmian Sequences. We note the following theorem: 

Theorem 6 (E. Coven, G. Hedlund [4]). A bi-infinite binary sequence X of minimal 
complexity (p{n) + 1) is Sturmian if and only if for every A which is a factor of X , 
A"'" is also a factor of X . 

and use it to derive the following: 

Theorem 7. A bi-infinite binary sequence X is Sturmian of irrational density a 
if and only if for any n S N, there is a unique factor A of length n such that A is 
a-heavy, and a unique factor A of length n such that p{A) is {—a)-heavy. 

Proof. Assume X is Sturmian, and let a ^ Q be its density. Then by Theorem 3, X 
contains arbitrarily long a-heavy factors {A = {0, 1} is certainly well-ordered). It 
is also seen that X contains as factors arbitrarily long transposes of a-light factors 
(using the 'a-light' version of Theorem 3, and noting that X-1X-2X-3 . . . is also 
Sturmian of density a). So, X has arbitrarily long factors A such that p{A) is 
(— a)-heavy. Therefore, there exists at least one word of each length which satisfies 
our criteria. 

Let A, B be two distinct factors of length n which are a-heavy. Assume they 
are of minimal length n > 0, so that A = CO and B = CI for some factor C. 
Then w{A) and w{B) are the two possible weights for factors of length n in X, 
and both weights are at least as large as na. As a ^ Q, both are strictly larger 
than na. Therefore, every factor of length n has average weight larger than some 
a + e, contradicting the fact that X was of density a. The proof is similar for two 
factors of length 71 > whose reversals are (— a)-heavy. If a G Q, the result does 
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not hold: consider X = ... (10)(10)1(10)(10)(10) . . .. This sequence is Sturmian, 
and the factors 10 and 11 are both 1/2-heavy. 

For the converse, it wiU suffice, in Hght of Theorem 6, to show that X is of 
minimal complexity and all transposes of factors are also factors. Assume that the 
following conditions all hold for 1 < i < n — 1 (they are easy to verify for i = 1): 

I. There is a unique a-heavy factor of length z, and a unique factor of length 
i whose reversal is (— a)-heavy. This condition has been assumed for all i. 

II. If ^ is a factor of length i, then is a factor. 

III. The factor A of length i — 1 is a right (left) special factor if and only if lA^ 
is a-heavy (p{lA) is — a-heavy). 

By combining (I) and (II), X contains unique a-light factors of length i, and 
unique words of length i whose reversals are (— a)-light. Adding (III), p(n — 1), as 
there is a unique right (or left) special factor for all i < n — 1. Establishing (II) 
and (III) for all ?i, then, would ensure both minimal complexity and admissibility 
of transposes, sufficient to show that X is Sturmian. 

Assume, then, that there is a factor A of length n > 2 such that A'^ is not a 
factor. Then, by our inductive hypothesis, Aq„_i and Af^ are both factors. As 
X is a bisequcncc, both of these words have precursors and successors (they do not 
begin or end X), so the words 

a„_i^o,ji-i a„_i^^„_iao and ^^„a^J = a„_i A^„_;^ao 

are factors. Note, then, that is both a left and right special factor. Therefore, 

by (III); 1j4i^„_i is the unique a-heavy factor of length n — 1. and ^i^„_il is the 
unique a-light factor of length 1 — n. Then 

(lAi,„_i = {Ai^n-ilf) =^ {Al„_^ = Ai,„_i) . 

As A'^ is not a factor, A ^ A^ , so ao ^ a„_i. Without loss of generality, let ao = 1 
and a„_i = 0. Let B = = Ai_„_i, for convenience. 

The following have been shown to be factors of X: IBO, QBO, and IBl. As IB 
was a-heavy, IBl is the unique a-heavy factor of length n. Therefore, IBO is not a- 
heavy, and by the Reversing Principle (Lemma 1), the word p{lBO) is (— a)-heavy. 
Then certainly p{OBO) is (— a)-heavy and of the same length, contradicting (I), our 
original assumption. Therefore, (II) holds for factors of length n as well. 

It remains only to show (III) for factors of length n. Let A be the unique right 
special factor of length n — 1; Al and AO both appear. Inductively, then, lA^ is 
the unique a-heavy factor of length n — 1. However, by (II), both lA^ and OA^ 
appear. So, if lA^ is not a-heavy, it follows (again, by the Reversing Principle) 
that Al and ^0 are both a-light, so that the reversals of the transposes are both 
(— a)-heavy, contradicting (I). 

In the reverse, let lA be the unique a-heavy factor of length n. Then lAo,„_2 is 
the unique a-heavy factor of length n — 1, so n-2 is a right special factor. If A'^ 
is nota. right special factor, then a„_2^i'„_2 is, and the previous reasoning would 
ensure that lAi.„_2a„_2 would be a-heavy, contradicting (I). 

So, there is a 1 — 1 correspondence between a-heavy factors of length n (which 
exist and are unique) and right special factors of length n — 1. Therefore, p{n) + 1 
for all n, and as X contains as factors all transposes of factors, X is Sturmian. □ 
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Remark. The heart of the above theorem is that Sturmian words can be character- 
ized as having unique special factors, and when special factors are unique, they can 
be characterized by a heaviness condition. 

Corollary 6. Let X be a Sturmian sequence (bisequence), of density a ^ Q. Then 
0^{X) contains exactly one sequence (bisequence) which is a-heavy. 

Proof. Let x{n) be the a-heavy factor of X which is of length n. It is seen that for 
m > n, m,n £ N, x(m) ~ x{n)A for some word A of length m — n. Therefore, in 
the compact space {0, 1}^, let 

X' = lim x{n). 

n — *oo 

The sequence X' is unique, is in the system generated by X, and X' uniquely 
extends on the left as a Sturmian bisequence (see [6, Ch. 6]). □ 

4.2. Local Heaviness in Sturmian Sequences. We will now approach the sub- 
ject of Sturmian sequences using local heaviness (Definition 6), rather than a- 
heaviness. 

Lemma 6. There is exactly one heavy Sturmian word of length n and weight m for 
any choice < m < n < oo. 

Proof. For convenience, denote a word of length n and weight m as type (rn,n), 
and the claim is apparent if m = or n = 1. Assume, then, that the claim is true 
for all words of length smaller than n. 

Let A be Sturmian and heavy, of type (pfci,pfc2), where ki and k2 are relatively 
prime. Then applying the pigeonhole principle and Lemma 4, A = A1A2 . . . Ap, 
where each Ai is Sturmian and heavy, of type (fci,A:2)- Therefore, it is sufficient to 
prove the claim in the event when m and n are relatively prime. If m > n/2, A 
must be of the form l"°01"iO . . . 1"'=0, where the Sturmian condition requires that 
each Tii = iV or iV + 1 for some N. As (m, n) = 1, both values occur (if all Ui = N, 
for instance, n = (fc + 1)(A^ + 1) and m = (A: + 1)A^), and the heaviness condition 
requires that no = + 1 and = A^. Define /(l"'0)i — TV, and associate to A the 
smaller word 

B = /(l""0)/(l"iO).../(l"^'0). 

We now show that B is Sturmian. Let B' and B" be factors of B of equal length 
such that w{B') = w{B") + 2. Then consider the two factors of A, f{A') = B' 
and f{A") = B": w{A') = w{A") + 2, but \A'\ = \A"\ + 2 as well. However, 
by assuming the minimality on the length of B' and B" , B" begins with a zero. 
Therefore, A" ^ Aq^n; A" is preceded by a zero. Let Ci = OA". Also, the last 
element of A' must be a zero, so let C2 = A'^ \A'\-i- Then w{C2) = w{Ci) + 2, but 
|Ci| ~ 1^*21, contradicting the assumption that A is Sturmian. 
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Now, to show that B is heavy, begin with knowledge that A is heavy and recaU 
= iV + /(l"'0): 

w{r°o . . . r^-'O) > w(i"'0 . . . i"''-io) 

-1 \ / k-1 \ / k-1 \ / i-1 

Ek + i)U + 

{iN + w{Bo,r)) {{k ~ + 1) + w{B,^k)) > {{k - i)N + w{B^^k) {i{,N + 1) + w{Bo,,)) 

(k - i)w{Bo,i) > iw{Bi^k) 
w{Bq^,) > w{B,,k) 

So _B is a heavy Sturmian word of smaUer length, and by the inductive hypothesis 
B is unique, and therefore A is unique. 

The proof works similarly for m < n/2, by considering the lengths of blocks of 
zeroes. □ 

Corollary 7. Let X be a Sturmian sequence which contains at least two ones and 
two zeroes. Let N = max{n : X contains 1" or 0" as a factor}, noting that our 
assumptions guarantee N < oo (and either 11 or 00 is not a factor X). Then X 
has exactly two distinct heavy factors of lengths n < N , and at most one heavy 
factor of all other lengths. 

Proof. All factors of X are Sturmian, and given a fixed length n, there are at 
most two weights possible for factors of length n. It is therefore clear in light of 
Lemma 6 that for any n, there are at most two heavy factors of length n. Assume 
that d{X) > 1/2, so that N is the longest string of ones. Then 1^0 (where 1^ 
represents a string of N consecutive ones) is a factor, so l^-^-iQ and for 
i = 0, . . . , iV — 1 are also factors, giving two heavy factors of lengths 1, . . . , iV. The 
proof is similar if TV is the length of the longest string of ones. 

If X has two distinct heavy factors of length n > N, then as they are Sturmian 
heavy words and distinct, they must be of two different weights, and the weights 
therefore differ by one. Let B and C be the two factors, and let w{B) = w{C) + 1. 
That n > N ensures that each of them begins with a one and ends with a zero; a 
heavy binary word containing both 1 and must begin with a one and end with 
a zero. Then it is seen that w(i?o,Ti-i) = w{B) = w{C) + 1 = w{Ci^n) + 2, which 
contradicts that X is Sturmian. □ 

Corollary 8. Let X be a Sturmian sequence whose density is a ^ Q. Then there 
is a unique locally heavy sequence in 0~^{X). 

Proof. Recall that for a sequence to locally heavy is, by definition, the same as the 
word being limsup„^3^T/;(Xo,ri)"heavy. We appeal to Corollary 6. □ 

We conclude with a remark on the construction of heavy and a-heavy Sturmian 
words. Given an a S [0, 1] and n £ N, define = [za] — [(z — l)a\ for i = 0, 1, . . . , n— 
1, where [a;] = max{n £ Z : n < u}. Then A = ao ■ . ■ a„_i is the unique Sturmian 
word of type {w{A), \A\), and if a ^ Q, A is the unique a-heavy factor of the infinite 
Sturmian sequence of density a. The sequence is related to the spectrum of a. 
See [5] and [2]. 
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